NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Large Content And Behavior Models To Understand, Simulate, And Optimize Content And Behavior

Khandelwal, A; Agrawal, A; Bhattacharyya, A; Kumar, Y; Singh, S; Bhattacharya, U; Dasgupta, I; Petrangeli, S; Shah, R R; Chen, C; et al (May 2024, 12th International Conference on Learning Representations (ICLR 2024))

Full Text Available
An Assure AI Bot (AAAI bot)

https://doi.org/10.1109/ISNCC55209.2022.9851759

Tellez, N; Serra, J; Kumar, Y; Li, J; Morreale, P (July 2022, Proceedings of ISNCC)

Artificial Intelligence (AI) bots receive much attention and usage in industry manufacturing and even store cashier applications. Our research is to train AI bots to be software engineering assistants, specifically to detect biases and errors inside AI software applications. An example application is an AI machine learning system that sorts and classifies people according to various attributes, such as the algorithms involved in criminal sentencing, hiring, and admission practices. Biases, unfair decisions, and flaws in terms of the equity, diversity, and justice presence, in such systems could have severe consequences. As a Hispanic-Serving Institution, we are concerned about underrepresented groups and devoted an extended amount of our time to implementing “An Assure AI” (AAAI) Bot to detect biases and errors in AI applications. Our state-of-the-art AI Bot was developed based on our previous accumulated research in AI and Deep Learning (DL). The key differentiator is that we are taking a unique approach: instead of cleaning the input data, filtering it out and minimizing its biases, we trained our deep Neural Networks (NN) to detect and mitigate biases of existing AI models. The backend of our bot uses the Detection Transformer (DETR) framework, developed by Facebook,
more » « less
Full Text Available
Validation of AI models for ITCZ Detection from Climate Data

https://doi.org/10.1109/DSIT55514.2022.9943879

Serra, J.; Fortes, S; Tellez, N.; Allaico, A; Landaverde, E.; Quezada, R.; Kumar, Y.; Li, J. J.; Morreale, P. (July 2022, Proceedings of 2022 5th International Conference on Data Science and Information)

This paper presents an innovative testing framework, testFAILS, designed for the rigorous evaluation of AI Linguistic Systems, with a particular emphasis on various iterations of ChatGPT. Leveraging orthogonal array coverage, this framework provides a robust mechanism for assessing AI systems, addressing the critical question, "How should we evaluate AI?" While the Turing test has traditionally been the benchmark for AI evaluation, we argue that current publicly available chatbots, despite their rapid advancements, have yet to meet this standard. However, the pace of progress suggests that achieving Turing test-level performance may be imminent. In the interim, the need for effective AI evaluation and testing methodologies remains paramount. Our research, which is ongoing, has already validated several versions of ChatGPT, and we are currently conducting comprehensive testing on the latest models, including ChatGPT-4, Bard and Bing Bot, and the LLaMA model. The testFAILS framework is designed to be adaptable, ready to evaluate new bot versions as they are released. Additionally, we have tested available chatbot APIs and developed our own application, AIDoctor, utilizing the ChatGPT-4 model and Microsoft Azure AI technologies.
more » « less
Full Text Available

Search for: All records